Search CORE

12 research outputs found

Utilizing Output in Web Application Server-Side Testing

Author: Alshahwan N
Publication venue: UCL (University College London)
Publication date: 28/09/2012
Field of study

This thesis investigates the utilization of web application output in enhancing automated server-side code testing. The server-side code is the main driving force of a web application generating client-side code, maintaining the state and communicating with back-end resources. The output observed in those elements provides a valuable resource that can potentially enhance the efficiency and effectiveness of automated testing. The thesis aims to explore the use of this output in test data generation, test sequence regeneration, augmentation and test case selection. This thesis also addresses the web-specific challenges faced when applying search based test data generation algorithms to web applications and dataflow analysis of state variables to test sequence regeneration. The thesis presents three tools and four empirical studies to implement and evaluate the proposed approaches: SWAT (Search based Web Application Tester) is a first application of search based test data generation algorithms for web applications. It uses values dynamically mined from the intermediate and the client-side output to enhance the search based algorithm. SART (State Aware Regeneration Tool) uses dataflow analysis of state variables, session state and database tables, and their values to regenerate new sequences from existing sequences. SWAT-U (SWAT-Uniqueness) augments test suites with test cases that produce outputs not observed in the original test suite’s output. Finally, the thesis presents an empirical study of the correlation between new output based test selection criteria and fault detection and structural coverage. The results confirm that using the output does indeed enhance the effectiveness and efficiency of search based test data generation and enhances test suites’ effectiveness for test sequence regeneration and augmentation. The results also report that output uniqueness criteria are strongly correlated with both fault detection and structural coverage and are complementary to structural coverage

UCL Discovery

Detecting malware with information complexity

Author: Alshahwan N.
Alshahwan N.
Barr E.
Barr E.
Clark D.
Clark D.
Danezis G.
Danezis G.
Menéndez H.
Menéndez H.
Publication venue: MDPI
Publication date: 01/01/2020
Field of study

Malware concealment is the predominant strategy for malware propagation. Black hats create variants of malware based on polymorphism and metamorphism. Malware variants, by definition, share some information. Although the concealment strategy alters this information, there are still patterns on the software. Given a zoo of labelled malware and benign-ware, we ask whether a suspect program is more similar to our malware or to our benign-ware. Normalized Compression Distance (NCD) is a generic metric that measures the shared information content of two strings. This measure opens a new front in the malware arms race, one where the countermeasures promise to be more costly for malware writers, who must now obfuscate patterns as strings qua strings, without reference to execution, in their variants. Our approach classifies disk-resident malware with 97.4% accuracy and a false positive rate of 3%. We demonstrate that its accuracy can be improved by combining NCD with the compressibility rates of executables using decision forests, paving the way for future improvements. We demonstrate that malware reported within a narrow time frame of a few days is more homogeneous than malware reported over two years, but that our method still classifies the latter with 95.2% accuracy and a 5% false positive rate. Due to its use of compression, the time and computation cost of our method is nontrivial. We show that simple approximation techniques can improve its running time by up to 63%. We compare our results to the results of applying the 59 anti-malware programs used on the VirusTotal website to our malware. Our approach outperforms each one used alone and matches that of all of them used collectively

Middlesex University Research Repository

State aware test case regeneration for improving web application test suite coverage and fault detection

Author: Alshahwan N
Harman M
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2012
Field of study

This paper introduces two test cases regeneration approaches for web applications, one uses standard Def-Use testing but for state variables, the other uses a novel value-aware dataflow approach. Our overall approach is to combine requests from a test suite to form client-side request sequences, based on dataflow analysis of server-side session variables and database tables. We implemented our approach as a tool SART (State Aware Regeneration Tool) and used it to evaluate our proposed approaches on 4 real world web applications. Our results show that for all 4 applications, both server-side coverage and fault detection were statistically significantly improved. Even on relatively high quality test suites our algorithms improve average coverage by 14.74% and fault detection by 9.19%. © 2012 ACM

Crossref

UCL Discovery

Detecting Malware with Information Complexity

Author: Alshahwan N
Barr ET
Clark D
Danezis G
Publication venue
Publication date: 26/02/2015
Field of study

This work focuses on a specific front of the malware detection arms-race, namely the detection of persistent, disk-resident malware. We exploit normalised compression distance (NCD), an information theoretic measure, applied directly to binaries. Given a zoo of labelled malware and benign-ware, we ask whether a suspect program is more similar to our malware or to our benign-ware. Our approach classifies malware with 97.1% accuracy and a false positive rate of 3%. We achieve our results with off-the-shelf compressors and a standard machine learning classifier and without any specialised knowledge. An end-user need only collect a zoo of malware and benign-ware and then can immediately apply our techniques. We apply statistical rigour to our experiments and our selection of data. We demonstrate that accuracy can be optimised by combining NCD with the compressibility rates of the executables. We demonstrate that malware reported within a more narrow time frame of a few days is more homogenous than malware reported over a longer one of two years but that our method still classifies the latter with 95.2% accuracy and a 5% false positive rate. Due to the use of compression, the time and computation cost of our method is non-trivial. We show that simple approximation techniques can improve the time complexity of our approach by up to 63%. We compare our results to the results of applying the 59 anti-malware programs used on the VirusTotal web site to our malware. Our approach does better than any single one of them as well as the 59 used collectively

arXiv.org e-Print Archive

UCL Discovery

Optimised Realistic Test Input Generation Using Web Services

Author: G.H.L. Pinto
K. Conroy
K. Lakhotia
M. Bozkurt
M. Di Penta
N. Alshahwan
N. Oster
P. McMinn
P. McMinn
R. Blanco
R. Sagarna
S. Elbaum
S. Yoo
W. Wan Ab. Rahman
X. Luo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Abstract. We introduce a multi-objective formulation of service-oriented testing, focusing on the balance between service price and reliability. We experimented with NSGA2 for this problem, investigating the effect on performance and quality of composition size, topology and the number of services discovered. For topologies small enough for exhaustive search we found that NSGA2 finds a pareto front very near (the fronts are a Euclidean distance of ∼ 0.00024 price-reliability points apart) the true pareto front. Regarding performance, we find that composition size has the strongest effect, with smaller topologies consuming more machine time; a curious effect we believe is due to the influence of crowding dis-tance. Regarding result quality, our results reveal that size and topology have more effect on the front found than the number of service choices discovered. As expected the cost-reliability relationship (logarithmic, lin-ear, exponential) is replicated in the front discovered when correlation is high, but as the price-reliability correlation decreases, we find fewer solutions on the front and the front becomes less smooth.

CiteSeerX

Crossref

Augmenting test suites effectiveness by increasing output diversity

Author: Alshahwan N
Harman M
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2012
Field of study

Crossref

UCL Discovery

Some challenges for software testing research (invited talk paper)

Author: Alshahwan N
Ciancone A
Harman M
Jia Y
Mao K
Marginean A
Mols A
Peleg H
Sarro F
Zorin I
Publication venue: 'American College of Medical Physics (ACMP)'
Publication date: 19/07/2019
Field of study

This paper outlines 4 open challenges for Software Testing in general and Search Based Software Testing in particular, arising from our experience with the Sapienz System Deployment at Facebook. The challenges may also apply more generally, thereby representing opportunities for the research community to further benefit from the growing interest in automated test design in industry

UCL Discovery

Generation of an induced pluripotent stem cell (iPSC) line (JUCTCi017-A) from a patient with limb-girdle muscular dystrophy (LGMD) due to a homozygous p.Lue287Ser fs14* mutation in the SGCB gene

Author: Abdalla Awidi
Ban Al-Kurdi
Dema Ali
Hebah Alshahwan
Leen Makahleh
Leena N. Alnsour
Mohammad H. Gharandouq
Mohammed El-Khateeb
Nidaa A. Ababneh
Nour Sharar
Raghda Barham
Renata M. Alatoom
Sabal Al Hadidi
Suzan Zalloum
Publication venue: 'Elsevier BV'
Publication date: 01/07/2021
Field of study

Limb-girdle muscular dystrophies (LGMDs) are a large group of heterogenous genetic diseases characterized by muscle weakness. In this study, an induced pluripotent stem cell (iPSC) line was generated from LGMD patient’s skin dermal fibroblasts, carrying a homozygous mutation in the Sarcoglycan Beta (SGCB) gene; chr4:52890221, c. 859 delC, p.Lue 287Ser fs14*. The reprogramming process was carried out using Sendai viruses encoding for Yamanaka factors. The resulting iPSCs showed normal morphology and karyotype, expressed pluripotency markers, demonstrated the potential to differentiate in vitro into three germ layers and retained the disease-causing SGCB mutation. This iPSC line represents an ideal source of cells for the investigation of LGMD disease mechanisms

Directory of Open Access Journals

L-2-hydroxyglutaric aciduria: identification of ten novel mutations in the L2HGDH gene

Author: Al-Sannaa N.
Alshahwan S. A.
Bataillard M.
Cimbalistiene L.
Cure S.
Fischer J.
Grolik C.
Jobard F.
Kemmerich V.
Mahfoud A.
Omran H.
Sass J. O.
Sztriha L.
Tabache M.
Topçu M.
Werlé E.
Publication venue
Publication date: 14/04/2008
Field of study

pub H-BRS - Publikationsserver der Hochschule Bonn-Rhein-Sieg

Parameterized representation and solution method of the lightweight 3D model virtual assembly constraint

Author: BR Campomanes-Alvarez
Buyun Sheng
C Zhang
Chenglei Zhang
CL Liu
DL Wu
DZ Wu
E Chavarriaga
F AlShahwan
F Mervyn
F Smedt De
F Tao
F Tao
Feiyu Zhao
JB Pettit
L Wen
M Alemanni
MS Kiraz
N Sasaki
NJ Mitra
P Singh
PG Jiao
Q Guo
QH Wang
R Iacob
RD Yang
S Junk
S Sampath
T Ichinose
W Li
W Liu
Xiyan Yin
XJ Liu
Y Jiang
Yao Shu
Z Feng
Z Kostic
Z Liu
Z Sun
Z-X Xia
Ó García
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref